Issue 387 resolved #395

supercoder-dev · 2024-06-13T13:35:08Z

To solve the problem of the head dimension exceeding the shared memory limit, we need to add a check after the line where d_inner is calculated. If d_inner exceeds a safe maximum value, we should set it to that maximum value.
To solve the problem of the head dimension exceeding the hardware limits, we need to add a check in the __init__ methods of both MixerModel and MambaLMHeadModel classes. This check will ensure that the head dimension (d_model) does not exceed a certain limit. If it does, it will adjust it to the maximum allowable value based on the hardware.
To solve the problem, we need to add a parameter to configure the head dimension (headdim) and ensure it is set appropriately. We also need to validate the head dimension to ensure it does not exceed hardware limits. Additionally, we need to adjust memory allocation and kernel function calls to use the configured head dimension and ensure memory usage is optimized.

supercoder-dev added 3 commits June 13, 2024 18:52

Update mixer_seq_simple.py

c6ec120

Update mamba_simple.py

000b310

Update mamba2.py

2100caf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue 387 resolved #395

Issue 387 resolved #395

supercoder-dev commented Jun 13, 2024

Issue 387 resolved #395

Are you sure you want to change the base?

Issue 387 resolved #395

Conversation

supercoder-dev commented Jun 13, 2024